Modified Privacy Preserving Data Mining System for Improved Performance
نویسنده
چکیده
Privacy of information and security issues now-a-days has become the requisite because of big data. A novel framework for extracting and deriving information when the data is distributed amongst the multiple parties is presented by Privacy Preserving Data Mining (PPDM). The concern of PPDM system is to protect the disclosure of information and its misuse. Major issue with PPDM that exists is to use the coherent data mining algorithm for preserving privacy of data. Various PPDM techniques have been proposed till now. One of them is DPQR (Data Perturbation & Query Restriction) algorithm, which is implemented only on Boolean data. In proposed approach, the SVD (Singular value decomposition) data perturbation technique is applied for data modification; discretization of raw data is done and generates the perturbed/distorted data. SVD technique improves the level of privacy protection by providing higher degree of data distortion. The algorithm is applied on perturbed data with association rule mining and Hamiltonian matrix concepts to find out frequent itemsets. By this the confidential data is preserved. The performance metrics for the approach are privacy level, efficiency, scalability and data quality. The time required for calculating matrix inversion is reduced by using Hamiltonian matrix. Main performance metrics is the privacy level and the privacy preserving degree is improved by dimensional reduction based perturbation i.e. multi dimensional perturbation (SVD and NMF) data perturbation techniques. The novelty of the proposed method is, it is being applied on numeric data and expected to achieve comparable parameters as shown on Boolean data.
منابع مشابه
A Improved Privacy Preserving Algorithm Us- Ing Association Rule Mining in Centralized Da- Tabase
The recent advancement in data mining technology to analyze vast amount of data has played an important role in several areas of Business processing. Data mining also opens new threats to privacy and information security if not done or used properly. The main problem is that to hide sensitive information, including personal information, fact or even patterns which are generated by any algorithm...
متن کاملFast Cryptographic Privacy Preserving Association Rules Mining on Distributed Homogenous Data Base
Privacy is one of the most important properties of an information system must satisfy. In which systems the need to share information among different, not trusted entities, the protection of sensible information has a relevant role. A relatively new trend shows that classical access control techniques are not sufficient to guarantee privacy when data mining techniques are used in a malicious wa...
متن کاملA Fast Secure Dot Product Protocol with Application to Privacy Preserving Association Rule Mining
Data mining often causes privacy concerns. To ease the concerns, various privacy preserving data mining techniques have been proposed. However, those techniques are often too computationally intensive to be deployed in practice. Efficiency becomes a major challenge in privacy preserving data mining. In this paper we present an efficient secure dot product protocol and show its application in pr...
متن کاملAn Improved EMASK Algorithm for Privacy-Preserving Frequent Pattern Mining
As a novel research direction, privacy-preserving data mining (PPDM) has received a great deal of attentions from more and more researchers, and a large number of PPDM algorithms use randomization distortion techniques to mask the data for preserving the privacy of sensitive data. In reality, for PPDM in the data sets, which consist of terabytes or even petabytes of data, efficiency is a paramo...
متن کاملPrivacy Preserving Data Mining
There is a tremendous increase in the research of data mining. Data mining is the process of extraction of data from large database. Knowledge Discovery in database (KDD) is another name of data mining. Privacy protection has become a necessary requirement in many data mining applications due to emerging privacy legislation and regulations. One of the most important topics in research community...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016